SUPPORT / SAMPLES & SAS NOTES
 

Support

Problem Note 54481: PROC HPTMINE might retain some Punctuation terms that have an attribute of Mixed

DetailsAboutRate It

In SAS® Text Miner, the HPTMINE procedure (and therefore the HP Text Miner node), does not correctly drop all terms that have a role of Punctuation. If the punctuation terms have a role of Mixed, then PROC HPTMINE fails to drop them. This problem occurs because PROC HPTMINE handles the characters using a LATIN1 encoding instead of a WLATIN1 encoding. WLATIN1 is a strict superset of LATIN1 encoding.

There are no errors or warnings to indicate a problem.

To work around the problem, add the following line to your sasv9.cfg file:

-ENCODING WLATIN1


Operating System and Release Information

Product FamilyProductSystemProduct ReleaseSAS Release
ReportedFixed*ReportedFixed*
SAS SystemSAS Text MinerMicrosoft® Windows® for x6412.1_M112.39.3 TS1M29.4 TS1M0
Microsoft Windows 8 Enterprise x6412.1_M112.39.3 TS1M29.4 TS1M0
Microsoft Windows 8 Pro x6412.1_M112.39.3 TS1M29.4 TS1M0
Microsoft Windows 8.1 Enterprise 32-bit12.1_M112.39.3 TS1M29.4 TS1M0
Microsoft Windows 8.1 Enterprise x6412.1_M112.39.3 TS1M29.4 TS1M0
Microsoft Windows 8.1 Pro12.1_M112.39.3 TS1M29.4 TS1M0
Microsoft Windows 8.1 Pro 32-bit12.1_M112.39.3 TS1M29.4 TS1M0
Microsoft Windows Server 2008 R212.1_M112.39.3 TS1M29.4 TS1M0
Microsoft Windows Server 2008 for x6412.1_M112.39.3 TS1M29.4 TS1M0
Microsoft Windows Server 2012 Datacenter12.1_M112.39.3 TS1M29.4 TS1M0
Microsoft Windows Server 2012 R2 Datacenter12.1_M112.39.3 TS1M29.4 TS1M0
Microsoft Windows Server 2012 R2 Std12.1_M112.39.3 TS1M29.4 TS1M0
Microsoft Windows Server 2012 Std12.1_M112.39.3 TS1M29.4 TS1M0
Windows 7 Enterprise x6412.1_M112.39.3 TS1M29.4 TS1M0
Windows 7 Professional x6412.1_M112.39.3 TS1M29.4 TS1M0
64-bit Enabled AIX12.1_M112.39.3 TS1M29.4 TS1M0
64-bit Enabled Solaris12.1_M112.39.3 TS1M29.4 TS1M0
HP-UX IPF12.1_M112.39.3 TS1M29.4 TS1M0
Linux for x6412.1_M112.39.3 TS1M29.4 TS1M0
Solaris for x6412.1_M112.39.3 TS1M29.4 TS1M0
* For software releases that are not yet generally available, the Fixed Release is the software release in which the problem is planned to be fixed.